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CLj ' Abstract 

^ ■ 
■^—> , The constant rate birth-death process is a popular null model for speciation and extinction. 

If one removes extinct and non-sampled lineages, this process induces 'reconstructed trees' which 

describe the relationship between extant lineages. We derive the probability density of the length 

of a randomly chosen pendant edge in a reconstructed tree. For the special case of a pure-birth 

process with complete sampling, we also provide the probability density of the length of an interior 



> 

(N 

0^ ' edge, of the length of an edge descending from the root, and of the diversity (which is the sum of all 



edge lengths) . We show that the results depend on whether the reconstructed trees are conditioned 
on the number of leaves, the age, or both. 
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1 Introduction 

The constant rate birth-death process is a widely- used null model for speciation and extinction [TTl [T^] . 
This model has been used to test the hypothesis of constant macroevolutionary rates and to quantify 
the rates of speciation [TSl [IHl HSl [21] ■ Despite its wide use, the process continues to reveal new and 
sometimes unexpected results: even the simple Yule model leads to a curious property highlighted 
in a recent paper [531 [IH]: the expected length of a randomly chosen edge in a Yule tree is half of 
the expected waiting time until a speciation event occurs. In order to attribute such "surprises" in 
empirical data to the null model instead of trying to find further explanations, the null models need 
to be well understood. In this paper, we fully characterize the lengths of pendant edges in birth-death 
trees on extant species (so-called reconstructed trees) , and improve our understanding of interior edge 
lengths in Yule trees. 

We will first explain the concept of a reconstructed tree which was originally introduced in J14j 
(see also Fig. [TJ. The birth-death process starts with a single species at time xq before the present. 
At all times until the present, each species has a constant rate A of speciation and a constant rate jl 
of extinction (with Q < jl < A). Such a process induces a birth-death tree. At the present, each extant 
species is sampled with probability /. Throughout this paper, we prune extinct and non-sampled 
species in the birth-death tree, i.e. we consider the birth-death tree which is induced by the sampled 
extant species. The tree without the extinct and non-sampled species is called the reconstructed tree, 
as empirical data typically infers this reconstructed tree (unless fossil information is included). 

We consider three different scenarios for stopping the process (i.e. defining the present): 

• Scenario (i): we condition the process on having n extant sampled species, or 

• Scenario (ii): we condition the process on having n extant sampled species and age xi for the 
most recent common ancestor of the extant species, or 

• Scenario (iii): we condition the process on having age xi for the most recent common ancestor 
of the extant species. 

As the start of the process (the time of origin xq or the time of most recent common ancestor 
Xi) is a parameter of the birth-death model, we have to assume a prior distribution for the time of 
origin when not conditioning the reconstructed trees on its age (Scenario (i)). We make the common 
assumption that the first species originated at any time xq in the past with uniform probability [T]. 



This is also called an improper prior on (0, oo). Conditioning the resulting reconstructed tree to have 
n extant species yields a proper distribution for the time of origin [S] . 

A special case of the birth-death process is the Yule model [23] , which is obtained by setting /i = 
and / = 1. Under the Yule model, Scenario (i) is equivalent to stopping the process just before the 
(n + l)-th speciation event [7j. 





(b) 

Figure 1: (a): A birth-death tree starting with a single lineage, and with extinct lineages ending in 'x'. 
The time from the extant lineages to their most recent common ancestor is xi. (b): The 'reconstructed 
tree' obtained by deleting the initial single lineage and all the extinct and two non-sampled extant 
lineages (namely the two labelled 3 and 4). 



Remark 1.1. In [50], it is shown that a birth-death process with parameters A, /t, / under Scenario (i) 
or (ii) induces the same distribution on reconstructed trees as a birth-death process with parameters 
A, /i and complete extant species sampling, where: 

A = /A,M = /i-A(l-/). 



Thus, under Scenario (i) and (ii), we will state all birth-death model results as functions of the 
transformed variables A, /i and complete sampling. We will further establish in this paper that the 
transformation also holds under Scenario (iii). 

In the following, we derive the probability density of the length of a randomly chosen pendent edge 
in a reconstructed tree generated by a constant rate birth-death process under Scenarios (i), (ii), and 
(iii). For the special case of a Yule model (with rate A) under Scenario (i), we find that a randomly 
selected pendant edge (or interior edge) has an exponentially distributed length with parameter 2A 
(Corollarv l3.2l and Theorem 13. 3p . This result generalizes [23], where the expected length of a pendant 



edge was calculated to be 1/(2A). For the Yule model (under Scenarios (i)-(iii)), we also derive the 
probability density of the length of an edge descending from the root and the sum of all edges. 

2 Preliminaries 

We first present some notation and a preliminary result that will be useful later. A reconstructed tree 
on n extant species has n — 1 interior vertices at times xi > . . . > Xn-i in the past (see Fig. [2]). We 
call the speciation event at time a;^ the /c-th speciation event. We say that a leaf a; of a reconstructed 
tree is adjacent to the fc-th speciation event if {v^x) is an arc of the tree, where v is the vertex that 
corresponds to the /c-th speciation event. For example, in Fig.[2]the left-most leaf of the tree is adacent 
to the second speciation event. In [19] . the following result was established; we will provide a shorter 
and more direct proof here. 




Figure 2: For fc = l,...,n — 1, a;^ denotes the time from the present into the past when the fc-th 
speciation event occurred in a reconstructed birth-death tree. The two edges incident with the root e' 
and e" (considered in Section H]) are also shown, with e' the shorter of the two. 



Theorem 2.1. For a reconstructed Yule or birth-death tree on n > 2 extant species, the probability 
under Scenario (i) or (ii) that a randomly- selected leaf is adjacent to the k-th speciation event is: 



v{k) 



2k 



n(n — 1) 



(1) 



Proof. Our proof is based on the equivalence of the probability distributions on tree topologies under 
three models - the Yule process, the reconstructed birth-death tree and the coalescent tree (if one 
ignores branch lengths) [2]. Observe that leaf L is adjacent to the fc-th speciation event vertex in the 
Yule tree if and only if L fails to coalesce for the first n — fc — 1 coalescence events, but does so on the 



next coalecence event (when fc + 1 points arc available to coalesce and k of these possible ( ^ ) pairs 
involve L). The probability of this, under the coalescent process, is precisely: 



n — k — l 

n 



n — ■] 
1 ■' 



m 



m 



Now, expansion and simplication (cancellation) of this product yields v{k) — ^r^_i\ ■ D 

The following expressions will be useful later. For < IJ. < \, we define, 

(i_e-a-<^)-) 

Po(s) := S (2) 

if /i = A; 



l+As' 



and 






if /i < A: 

Pi(s) :- r-^""^ "" ^'"^"' (3) 

if ^ = A. 



1 

(l + As)^- 



The significance of these quantities is that the probability of a lineage producing (resp. 1) offspring 
after time s is iJ,po{s) (resp. Pi(s)) [S]. Note that, for Yule trees, we have: 



As\ /\. „„ 1 „ /„\ -As 



Po{s) = (1 - e-^")/A; and pi(s) = e 

3 Length of a pendant edge in birth— death and Yule trees 

In this section, we calculate the probability density function (pdf ) of the length of a random pendant 
edge in a reconstructed tree. For Scenario (i), we use this to derive the expected length of such an edge; 
moreover we calculate the pdf of the length of a random interior edge for a Yule model. For Scenario 
(ii), we use the pdf to calculate the expected length of a random pendant edge. For Scenario (iii), we 
additionally establish a general transformation equivalence between models (Theorem 13. 9p . 

(i) Conditioning on n 

For pendant edges, the following result was established in [lOj . 



Theorem 3.1. The length of a randomly selected pendant edge in a reconstructed birth-death tree on 
n extant species has probability density function: 

fp{s\n)^2\p,{s)il-Xpo{s)), 

and expectation, 

= /^+(A-M)log(l-/./A)_ 

Here we note the following direct consequence of this result for the Yule model (obtained by setting 
M = 0). 

Corollary 3.2. The length of a randomly picked pendant edge in a tree on n leaves under the Yule 
model with rate A is exponentially distributed with rate 2A. 

We now establish a corresponding result for a randomly selected interior edge in a Yule tree (again 
under Scenario (i)). 

Theorem 3.3. The length of a randomly selected interior edge in a Yule (rate \) tree on n leaves is 
exponentially distributed with rate 2A. 

Proof. We will establish this theorem by induction on n. For n = 3, we have one interior edge. The 
waiting time between the first speciation event (yielding two species) and the second speciation event 
(yielding three species) is the length of the interior edge. This waiting time is the time until the first of 
the two species speciates. As each species has a speciation rate A, this waiting time is an exponential 
distribution with rate 2A. 

Now assume we established the theorem for n = k. For establishing the theorem for fc + 1, we first 
note that the Yule tree on fc + 1 species with the uniform prior for the time of origin corresponds to a 
Yule process forward in time which is stopped just before the (fc + l)-th speciation event [7]. Now cut 
off the tree on fc + 1 leaves Tk+i at time Xk, yielding a Yule tree on fc leaves Tk- A randomly selected 
nterior edge Tk has an exponentially distributed length with rate 2 A (induction assumption). Each 
interior edge in Tfe is also an interior edge in Tk+i- Additionally, in Tk+i, one of the pendant edges 
becomes an interior edge (the edge which speciates at time Xk)- As each randomly selected pendant 
edge in Tk has an exponentially distributed length with rate 2 A fCorollarv l3.2p . the new interior edge 
in Tk+i also has an exponentially distributed length with rate 2A. This yields that a randomly picked 
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interior edge in 7fe+i has an exponentially distributed length with rate 2A, and thereby establishes the 
induction step. D 

(ii) Conditioning on n and Xi 

In order to derive the edge length distribution, we need the following lemma. 

Lemma 3.4. The probability density function for the time of the k-th speciation event in a recon- 
structed birth-death tree, conditional on having n > 2 extant species and on the first speciation event 
being at time xi is: 



/„,fc(s|n,Xi) = (n-2)(''^_^')G(s|xi)' 



^ 1 /'l /^/ ri I ^ \\f*^ 2 



(l-G(s|xi))^-^5(s|xi),fc = 2,...n-l, 



where 



9{s\xi) = — -. — r, G{s\xi)- 



Po{xi)' PoixiY 

with Pq and pi the functions described in Eqns. (0i and (0). 

Proof. In [5], we established that the speciation times X2 > ■■■ > Xn-i are the order statistics of 
n — 2 i.i.d. random variables S2,---,s„_2 with probability density g{s\xi). For the distribution, by 
integrating g{s\xi) with respect to s we obtain the function G{s\xi). Now, Xk+i is the fc-tli order 
statistic (with the first order statistic being the largest and the {n — 2)-th order statistic being the 
smallest value), thus we have the probability density function for Xk (e.g. [3]): 

/„,fc+i(s|i) = (n - 1 - fc) f " ^ ^ jG(,s|t)"-^-2(l - G(s|t))'-ig(s|i). 



Equivalently: 



fn,k+iis\t) = in~2)[l_ J JG(s|i)"-'-'(l - G{s\t))'^-'g{s\t). 



n 



Theorem 3.5. The length of a randomly selected pendant edge in a reconstructed birth-death tree on 
n extant species and age Xi has the probability density function: 

fp{s\n,xi) ^2— — n-1 - n-3 

n(n- l)po(a;i) V Pa{xi) J 



for s < xi and: 



where S is the Dirac delta function. 



2 

/p(xi|n,a;i) = — -tS{0), 

n(n — 1) 



Proof. The proof is similar to the proof of Theorem 13. II in |10| . For s ~ xi, the pendant edge must be 
attached to the root which has probabihty 2/{n{n — 1)) (Eqn. ([T|)). The length of the pendant edge is 
xi with mass equal to 1 which is formalized using the Dirac delta function. 
For 3 < xi, we have: 

n-l 

fp{s\n,xi) = ^v{k)fn,k{s\n,xi) 



fc=2 



n-l 



= 2-|-^5(^ki) (2 + 'j2in - 3) (" ^ ^) G(.|xr)"-'^-^(l - G{s\x,)r'] 
= 2^^^5(51x1) (2 + (n-3)(l-G(s|a:i)) 

= 2ip^.g(5ki) ((ri - 1) - (n - 3)G(.|xi)) , 
n[n — 1) 

which establishes the theorem. D 

Evaluating the first moment integral of fp{s\n,xi) yields, 

Corollary 3.6. For < /i < A, the expected length of a pendant edge is: 

E[p\n,x,] = \ (2x1+ , wi''~!l_^w^ g^ 



where 






xipi{xi) (A - e 



-(A-/j)xi 



i ( T-^ (a - e-(^-'')^v) - e-(^-^)^^(n + 1) 



A2 
'''^^' -^>^Pf'^^ (e-(--)-M-4M - (n + 1)(A - A.)) - A(-4A + (n + 1)(A - /.))) . 

For jjL = \, 

E p n, xi = ^ -— 2x1 + , '2 {n - 7 xi + n + 1 po -^i + T log 1 + Aa^i 

Setting /i = in Corollary 13.61 yields Theorem 2 of \TU\ . 

(iii) Conditioning on xi 

Lemma 3.7. The probability of a reconstructed birth-death tree with the first speciation event being 
at time xi having n >2 extant descendants is, 

Pn XI = [n- 1 7^ 7 77^ ■ 

{1~ ^lpo{xl)y 

In particular, the probability p„(xi) can be written as a function of the transformed parameters X, fi 
instead of A, /t, /. 

Proof. The probability of a single indiyidual haying n extant and sampled offspring after time xi, 
giyen that it has at least one offspring is [21] : 

/ - \ n— 1 

^ _ /(A - /i)2e-(^-'^)^i / /A(l-e-(^-'^)^i) ^ /A + (A(l - /) - /i)e-(^-'^)^ 

Pn(^li 



(/A + (A(l - /) - A)e-(^-'')-0' V/^ + (^(1 - /) - A)e-(^-«^i 7 /(A - A) 

Using the transformation of the parameters A and /t to A and p as giyen in Remark II. 11 we can express 
Pnixi) as a function of only A, fi, n, xi: 

^ I X Px{xi)(\pq{x{)Y-'^ 

Pn[Xl) = . 

1 - fipo[Xi) 



This leads to: 

, , "^ Pi(xi)(Apo(xi))'--i Pi(.Ti)(Apo(x-i))"-'--i 

Pn(xi) = > ^ r 

^" ' ii-fm{xiW ■ 

D 

Theorem 3.8. The length of a randomly picked pendant edge in a reconstructed birth-death tree of 
age xi has probability density function for s < xi: 

fp{s\xi) = 2 -——^[h{l,xi) ——h{3,xi) 

Po{xi){l - UPoixijj'' \ Po[Xij 

where 

{k + l)Xpoixi)-k 2fclog[l- Apo(2:i)] 2fc 



/l(fc, Xi) 



{l-Xpo{xi))^ (Xpoixi))^ Xpoixi)' 

and for s ~ xi, 

f,{x,\x,) = -2(log(l - Xpoix,)) + Xp,ix,)) ( ^;^^^^ ) 5{0), 

\Xpo[xi)[l - fipo(xi}}J 

where 6 is the Dirac delta function. In particular, the probability fp{s\xi) can be written as a function 
of the transformed parameters A, fi instead of A, /i, /. 

Proof. Using Theorein l3.5l and Lemma [5771 for s < Xi, we have (using the transformed parameters A, fx 
instead of X,fi,f): 



fp{s\xi) = ^fp{s\n,xi)p{n\xi) 



n— 3 

^ n{n - 1) po{xi) \ Pa{xi)J (I - npo{xi)y 

pi{s)pi{xiy ^ (n-2) f Q^Jfli^Vx f ^^n-2 
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Wc also have: 



hik,xi):= = y2- (n-fc)(Apo(a:i)r-2 

71 — 3 
oo oo (-. 

■^ — ^ ■'^ — ^ n 

n— 3 n— 3 

^ Apo(a:i) k 2fc -A (Apo(a:;i))" 

(l-Apo(xi))2 1-Xpoix,)^ + (Apo(a;i))2 ^3 

(fc + l)Apo(x-i)-fc 2fc (-log[l - Xpojx,)] - Xpojx,) i^Eop})!) 

(l-Apo(xi))2 + '+ (Apo(xi))2 

_ (fc + l)Apo(2;i)-fc 2fclog[l- Apo(a;i)] 2fc 

(1 - Apo(a;i))^ (Apo(2;i))^ Apo(2:i)' 

Overall, this yields: 

f t I \ o Pi(s)pi{xi)^ ( ,,^ . Pais) ,,„ - 

For s = Xi, we obtain fp{xi\xi) in an analogue way, using fp{xi\xi) = X]^2 /p(^il"'i^i)p("-l^i)- ^ 

Evaluating the first moment integral of fp{s\xi), or evaluating X]n=2 ^[pI'^' 3;i]p(n) with E[p|n, xi] 
from Corollary 13.61 provides an analytic expression for the expected length of a randomly chosen 
pendant edge. 

The following theorem establishes a general result under Scenario (iii), namely that a birth-death 
model with incomplete sampling can be transformed into a birth-death model with complete sampling 
when considering reconstructed trees. 

Theorem 3.9. Let T be a reconstructed tree conditioned on xi. The probability density of T having 
speciation times a;2, . . . , a;„_i with parameters X, p., f equals the probability density of T having spe- 
ciation times X2, ■ ■ ■ ,Xn-i with parameters A,/z (where A = /A,/i = ji — A(l — /), as above) under 
complete sampling. 

Proof. The probability density of x is provided in [21], Theorem 3.8, with m = 0, 

2 II- 1 



f{x\x,)=(-^^^\ ■WM.ix..), 
\l-qo{xi)J H- 
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where 



qo[xi) = 1- 



gi(a^i) 



/(A - /i)2e-(^-'^)^i 



(/A + (A(l-/)-/i)e-(^-A)-i)2 
The transformation A = /A, ^ = fi — A(l — /) now cstabhshes the theorem. D 

4 Length of a root edge in a Yule tree 

A reconstructed tree has two edges descending from the root, and we denote these as e' and e" where 
we may assume that e' is shorter than e" (see Fig. [2|) . By a root edge we mean the selection of e' or 
e" with equal probability. We will calculate the length of the root edge in a Yule tree under Scenario 
(i)-(iii). 

We will show that, under Scenario (i), a root edge is longer than a randomly chosen interior edge 
(which has exponential distribution with parameter 2 A, Theorem [ 



(i) Conditioning on n 

Let Xi be an exponentially distributed random variable with parameter iX. Then for fc > 2, Ik 
^,-^2 -^i is the hypo-exponential distribution with probability density: 



fiAt) = E^*e"''* n 



k 

J 



7 — z 

i=2 j=2,j^i -^ 



which can be transformed to 

fc! 



/,,(i) = 5]Ae-^^*(-l)^ 



i=2 

k 



(fc-i)!(i-2)! 



fc(fc-l)^A(-e-^'r(^:^2). (4) 



J-2 

i=2 



Theorem 4.1. The length L of one of the two edges descending from the root (picked uniformly at 
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random) has probability density function: 

hit\n) = Ae-^* (1 - (1 - e-^*)"-'(l - ne"^*)) . 

Proof. In a Yule tree with n extant species, the shorter edge e' of the two edges descending from 
the root has a length L' that is exponentially distributed with parameter 2A. The length L" of the 
longer edge descending from the root, e", is calculated as follows. First note that the waiting time, Xi, 
between the i — 1-th and i-th speciation event is an exponential distribution with parameter iX. Let e" 
terminate at the fc-th speciation event in the tree. The length of A is then Ik with probability density 
given in Eqn. Q. The probability that A terminates at the fc-th speciation event (2 < fc < n) is: 

and the probability that A does not terminate until the present is, 

Pn = l[(l-])=2/in-l). 

The length of e" is: 



.=3 ^ J 



i" = E^feP^' 



fc=3 
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and thus the density function of L" is, 

n 
fL"{t\n) = ^//JOPfe 



n — 1 k 



fc=3 i=2 ^ ^ 2=2 ^ 

n-ln-1 ^, „s n-1 n y „\ 

i=3 fe=i ^ ^ fe=3 i=2 ^ ^ 

i=3 ^ ^ i=2 ^ 

= 2X]A(-e-^*r (('-1^ +"("12)) +2(2n-3)Ae-2^* + 2nA(-e-^*r 

n-l ^ s 

= 2 ^ A(-e-^*)'z ( '"_ I + 2(271 - i)\e~^^' + 2n\[-e~^'T 



i=3 
n-1 



= 2A ^ i{^e-^'y r _ M + 2(2n - SjAe'^^* + 2nA(-e-^*)" 
= 2Ae-^* (1 - e"^* -- (1 - e~^*)"-2(l - ne"^*)) . 

The density of L is, 

fL{t\n) = h'{t)/2 + h"it)/2 

= Ae-^* (1 - (1 - e-^*)"-2(l - ne-^*)) , 

which estabhshes the theorem. D 

Calculating the first moment of L yields: 

Corollary 4.2. The expected length L of one of the two edges descending from the root (selected at 
random) is E[L|7i] = j(l — — ). 

Comparing this result with Corollarv l3.2l and Theorem 13.31 we see that for a Yule tree conditioned 
on having n species, the expected length of one of the two edges descending from the root, selected at 
random, is (asymptotically) twice the length of a randomly selected edge (or of a randomly selected 
pendant edge, or of a randomly selected interior edge) . 
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(iii) Conditioning on xi 

Before eonsidering Scenario (ii), we first consider Scenario (iii), i.e. the time since the root is xi. In a 
Yule tree with the root having age xi, select one of the two edges incident with the root uniformly at 
random (e.g. by a fair coin toss), and let L denote its length (up to time xi). Then L has a discontinuous 
distribution: 

{e^^\ for < / < xi; 
0, for I > xi] 
which implies that E[i|a;i] = i(l — e"'^^^), and so, in particular: 

E[i|xi] = i-o(l), 

where o(l) is a term that goes to zero exponentially fast with xi. 

Note that the distribution of L should not be confused with the truncated exponential distribution 
arising from the conditional probability P(i > l\L < t) , since this is the probability that a speciation 
event occurs on this lineage before or at time xi. 

(ii) Conditioning on n and xi 

We first consider a Yule tree starting from a single lineage at time and grown for time t. Let Kt 
be the number of leaves at time t, and consider the length / of the initial edge up to time t. Let 
P(/ > l\Kt = fc) denote the probability that / is greater than /, conditional on the event that Kt = k. 



Lemma 4.3. 



a''"\ ifl<t; 
\I > l\Kt = k) = 

'O, ifl>t; 



where a — ^ ^_ _y^ — , 



Proof. Let f{s\Kt — k) be the density of / conditional on Kt = k. By Bayes' formula: 
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where 

/(,s) = Ae-^^ 

is the (unconditional) first branch length. The unconditional distribution of Kt is 

¥{Kt = fc) = (1 - e-^^f-^e-^K 
Moreover, it is an easy exercise to show that, for s < i, 

f{Kt = k\I = s) = {k- 1)(1 - e-M*-'*))fc-2g-2A(t-«)_ 

Combining these three expressions into ([5]) gives: 



(l-e-^*)*^- 



ns\K. = k)^ ^---^^ ^-- '—. (6) 



Now, 



(/ > l\Kt = k)= f f{s\Kt = k)ds. (7) 



and so substituting ([6]), applying the substitution m = 1 — e -*'*e^'' and rearranging, one obtains: 



\I >l\Kt = k) 



fc-1 



l-e-"'e" 



u^'-'^du, (8) 





which gives the value a^~^ for I <t. D 

Consider now a Yule tree T and suppose we condition on both n and xi. Select uniformly at random 
one of the edges of T that are incident with the root, let L denote its length up to time xi, and let 
P(L > /|n,.Ti) denote the probability that L > I, conditional on T having n leaves at time xi. 



Theorem 4.4. 



l-a" 



for I < Xi] 



F{L>l\n,xi)^ ^ « 1 V 1 " / (g) 

0, for I > xi. 

Proof. It is a fundamental property of the Yule model that the number of leaves beneath the selected 
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edge has a uniform distribution between 1 and n — 1. Thus, if / < xi then: 

-. n — 1 

F{L> l\n,xi) = ^P(/>/|i^^i =k) 



k=l 



and the result now follows directly by Lemma 14731 D 

Let us now replace A by its maximum likelihood (ML) estimate Xml ~ lii(§)/a;i and evaluate the 
expression for P{L > t\n,xi) as ri -> oo. Note that if we let A = \n{^)/t then 



1-^e 



2 ^\l 



a 



1-^ ' 

n 

and so: 

.-1 ^ (1 - T^'r-' exp(-2e^0 

where A — Xml ~ InC'f )/^ ^-^d where ^ denotes asymptotic equivalence as n — > cxd. Moreover, 

1 1 n 1-^ 1 



n-1 \-a 2(n - 1) e-^' - 1 2(e^' - 1) 

So, from ([9]), we obtain: 

1 - er"^ 
P(i > l\n,xx) - 



w 

where w — 2(e^' — 1) and where ^ denotes asymptotic equivalence as n — ^ oo. 

To determine the expectation of L conditional on ti, x\ we simply integrate this expression from 
L — Q Xo L — x\ with respect to I (using the well known identity that E[X] ~ L P{X > x)dx for a 
non- negative continuous random variable X). Noting that dl = xi2+w'\ ' ^'^ obtain the following result 
which exhibits a different limit to the value 2-^ for a randomly selected edge of T or the limiting value 
y described above when we just condition on xi. 

Corollary 4.5. The expected length 0/ one of the two randomly selected root edges, conditional on n 
and Xi, and with A set equal to its ML value, converges to j- as n —> 00 where: 

da; = 0.8158... 



/o x{2 + x) 
It is instructive to compare this result with Corollary 
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5 Diversity in a Yule tree 

In this section, we calculate the sum of all edge lengths in a Yule tree which is also called the diversity. 

(i) Conditioning on n 

Theorem 5.1. The sum D of all branches in a Yule tree with n >2 leaves has a gamma distribution 
with density function: 

\n—lp—Xdjn—2 

In particular, D has mean ¥,[D\n] = ^^^ and variance Var[D|n] ~ ^^^ji and is asymptotically normally 
distributed. 

Proof. The sum of all edge lengths in a Yule tree with n leaves is D ^ X]?=2 ^-^j where Xj has 
an exponential distribution with parameter Xj. Note that jXj has an exponential distribution with 
parameter ^ ■ Xj = X, and so D is a sum of n — 1 independent exponential random variables, each 
having parameter A, and so D has the claimed gamma distribution. 

D 

(ii) Conditioning on n and Xi 

As shown in [5], a Yule tree of age xi with n > 3 leaves, the n ~ 2 speciation events ^2, . . . , Sn-i 
descending from the root are i.i.d. random variables with density: 

9(s) 



1 — e^'^^i 



The sum of all branch lengths is the sum of the n — 2 speciation times and 2x1 (accounting for the 
two branches descending from the root). 

Theorem 5.2. The sum D of all branches in a Yule tree of age xi and n leaves has moment generating 

function: 

/ AH - p(^-^)^M \"~^ 
""""••-■■' --'-" ( (A -.,),; -.-i) ) ^ 

Proof Let C denote the sum of the n — 2 speciation times 5*2, ... , Sn-i- Then C is a convolution of 
n — 2 random variables with probability density 17(5) = j^^^-axi ■ If Mc{s\n,xi) denotes the moment 
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n-1 



generating function for C , then: 

Mc{s\n, xi) = ¥.[e^'] = E[e^^=^' ^^1 = H "^[^^''l 






1 — p^^^i , 



A(l - e('^-^)^0 ^ " ^ 
(A-s)(l-e-^^i) 



Now D = 2xi + C and so A/D(s|n,xi) = e^^i'*Mc(s|n,a;i), which leads directly to the expression 
claimed. D 

(iii) Conditioning on xi 

The sum D of all branches in a Yule tree of age xi has probability density function, 



jD{d\xi) = ^ pn{xi)fD{d\n, Xl), 



n=2 

where fD{d\n,xi) is the density conditional on n and xi. In particular, noting that E[£'|n, xi] 
■£MDis\n,xi)\s=o, we have: 



CXD , 

l[D\xi] = y^^Pn{xi) — MD{s\n,xi)\s=o, 



which gives: 



ds 

n—2 



E[D\x,] - ^(e^-i - 1), 



a result that was derived via a different argument in 



6 Conclusion 



In this paper, we derive the probability density and expectation for the length of a randomly picked 
pendant edge of a reconstructed birth-death tree with random sampling of extant individuals. We 
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investigate this under three scenarios in which the resulting reconstructed trees induced by the birth- 
death process are finite: we either consider trees (i) with a fixed number of leaves, (ii) with a fixed 
number of leaves and a fixed age of the most recent common ancestor of the extant sampled species, 
or (iii) with a fixed age of the most recent common ancestor of the extant sampled species. 

We first noted that under our three Scenarios (i)-(iii), the original process with sampling (pa- 
rameters A, /i, /) can be transformed into a birth-death process and complete sampling through 
A = /A, /i = fi — A(l — /). Hence, we state all results as functions of the transformed parameters 

For the Yule model we further determine under our three Scenarios (i)-(iii) the probability density 
for the length of an edge descending from the root and the sum of all edges, which is also known 
as the diversity. In particular, we show that the pendant as well as the interior edge lengths under 
Scenario (i) are exponentially distributed with parameter 2A while an edge descending from the root is 
asymptotically twice as long compared to a randomly chosen edge. Furthermore, the diversity follows 
a gamma distribution. 

Knowledge of the branch lengths and diversity in a phylogenetic tree is important for conservation 
strategies. When present-day species become extinct, short pendant edges cause little diversity loss 
while long pendant edges cause severe diversity loss. In previous work, the expected loss of diversity 
was considered under constant rate birth-death models with mild extinction |10| , and under constant 
rate birth-death models with severe extinction, namely A = /x |13| (note that |13| derived the results 
under the coalescent with constant population size; in expectation, such coalescent trees equal birth- 
death trees with A = /x [B]). Characterizing the full distribution of the loss of diversity remains an open 
task. Our results yielding the branch length and diversity distribution might be a first step towards 
characterizing the distribution of diversity loss (rather than only the expectation). Knowledge of the 
distribution will help understand the stochastic variability of diversity loss. An asymptotic normal law 
for diversity loss was recently established by [J , but this was established only for certain deterministic 
classes trees, rather than for trees generated by a stochastic process. 

For future work, it would be interesting to compare the branch lengths and diversity (loss) under 
the constant rate birth-death process to models under which the rates of spcciation and extinction 
may vary. In particular, speciation and extinction rates may dependent on the number of species that 
are extant at that moment (i.e. density-dependent speciation) |17j . or on time (i.e. environmental- 
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dependent speciation) |14[|22) . or on a particular trait (i.e. trait-dcpcndcnt spcciation) [3]. 
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